Three dimensional stacked nonvolatile semiconductor memory in which the channel potential of a memory cell in a non-selected NAND cell unit is increased

ABSTRACT

A three dimensional stacked nonvolatile semiconductor memory according to an example of the present invention includes a memory cell array comprised of first and second blocks. The first block has a first cell unit which includes a memory cell to be programmed and a second cell unit which does not include a memory cell to be programmed, and programming is executed by applying a program potential or a transfer potential to word lines in the first block after the initial potential of channels of the memory cells in the first and second cell units is set to a plus potential. In the programming, the program potential and the transfer potential are not applied to word lines in the second block.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of and claims the benefit of priority under 35 U.S.C. § 120 from U.S. application Ser. No. 16/251,419 filed Jan. 18, 2019, which is a continuation of U.S. application Ser. No. 15/916,332 filed Mar. 9, 2018 (now U.S. Pat. No. 10,224,106 issued Mar. 5, 2019), which is a continuation of U.S. application Ser. No. 15/367,361 filed Dec. 2, 2016 (now U.S. Pat. No. 9,953,708 issued Apr. 24, 2018), which is a continuation of U.S. Ser. No. 14/994,203 filed Jan. 13, 2016 (now U.S. Pat. No. 9,558,833 issued Jan. 31, 2017), which is a continuation of U.S. Ser. No. 14/191,172 filed Feb. 26, 2014 (now U.S. Pat. No. 9,275,737 issued Mar. 1, 2016), which is a continuation of U.S. Ser. No. 13/738,633 filed Jan. 10, 2013 (now U.S. Pat. No. 8,681,551 issued Mar. 25, 2014), which is a continuation of U.S. Ser. No. 13/336,122 filed Dec. 23, 2011 (now U.S. Pat. No. 8,379,449 issued Feb. 19, 2013), which is a continuation of Ser. No. 12/953,690 filed Nov. 24, 2010 (now U.S. Pat. No. 8,102,711 issued Jan. 24, 2012), which is a continuation of U.S. Ser. No. 12/407,094 filed Mar. 19, 2009 (now U.S. Pat. No. 7,859,094 issued Dec. 28, 2010), and claims the benefit of priority under 35 U.S.C. § 119 from Japanese Patent Application No. 2008-112659 filed Apr. 23, 2008, the entire contents of each of which are incorporated herein by reference.

BACKGROUND OF THE INVENTION 1. Field of the Invention

The present invention relates to a three dimensional stacked nonvolatile semiconductor memory.

2. Description of the Related Art

BiCS (Bit Cost Scalable) technology is known as a technology for suppressing a bit cost of a semiconductor memory by increasing the capacity thereof by a three dimensional structure (refer to, for example, “Bit Cost Scalable Technology with Punch and Plug Process for Ultra High Density Flash Memory” 2007 Symposium on VLSI Technology Digest of Technical Papers. p. 14).

A nonvolatile semiconductor memory to which the BiCS technology is applied (hereinafter, called a BiCS memory) has a feature in that it not only has a three dimensional structure but makes bit cost scalability possible so that a bit cost can be reduced in proportion to an increase of the number of stacked layers by devising a device structure and a process technology.

In, for example, a NAND flash memory to which the BiCS technology is applied (hereinafter, called a BiCS-NAND flash memory), a memory capacity, which greatly exceeds the limit of the memory capacity of a NAND flash memory having a two-dimensional structure, can be realized by increasing the number of cells in a longitudinal direction which comprise a NAND column by increasing the number of stacked layers.

However, since the BiCS memory which is represented by a BiCS-NAND flash memory has a unique device structure, there are many problems to be solved to practically use the BiCS memory.

A program disturb is exemplified as one of the problems.

The BiCS memory has such a feature that cell units are included in one block connected to one bit line. Further, the cell units cannot be selected at the same time from the viewpoint of a circuit operation. Accordingly, a non-selected cell unit which does not include a memory cell to be programmed exists in a selected block.

This problem does not occur in a flash memory having a two dimensional structure.

Therefore, program disturb must be examined to prevent a variation of a threshold voltage of a memory cell in a non-selected cell unit in a selected block in programming.

In particular, in the BiCS memory, since it is not necessary to apply a program potential to a cell unit in a non-selected block different from the flash memory having the two dimensional structure, it is not necessary to examine program disturb to the cell unit in the non-selected block. However, since the program potential is applied to a non-selected cell unit in a selected block, the BiCS memory has a special property in that program disturb occurs in the non-selected cell unit.

BRIEF SUMMARY OF THE INVENTION

A three dimensional stacked nonvolatile semiconductor memory according to an aspect of the present invention comprises a semiconductor substrate, a memory cell array comprised of first and second blocks disposed on the semiconductor substrate side by side in a first direction, and a first driver disposed on one end of the memory cell array in a second direction orthogonal to the first direction.

Each of the first and second blocks is comprised of at least three conductive layers stacked on the semiconductor substrate by being insulated from each other, a bit line disposed on the at least three conductive layers by being insulated therefrom, and columnar semiconductors having lower ends connected to the semiconductor substrate and upper ends connected to the bit line and passing through the at least three conductive layers.

An uppermost layer of the at least three conductive layers is comprised of first select gate lines extending in the second direction, a lowermost layer of the at least three conductive layers is a second select gate line, remaining conductive layers excluding the uppermost layer and the lowermost layer of the at least three conductive layers are a word line, and remaining conductive layers excluding the uppermost layer of the at least three conductive layers have a plate shape whose width in the first direction is larger than the width in the first direction of the first select gate lines.

Select gate transistors are comprised of the first select gate lines and the columnar semiconductors, and the second select gate line and the columnar semiconductors, respectively and memory cells are comprised of the word line and the columnar semiconductors, respectively.

The first block has a selected first cell unit including a memory cell to be programmed and a non-selected second cell unit not including a memory cell to be programmed, and programming is executed to the memory cell to be programmed by applying a program potential or a transfer potential lower than the program potential to the word lines in the first block after an initial potential of channels of the memory cells in the first and second cell units is set to a plus potential. The program potential and the transfer potential are not applied to the word lines in the second block in the programming.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

FIG. 1 is a bird's eye view of a BiCS-NAND flash memory;

FIG. 2 is a plan view of the BiCS-NAND flash memory;

FIG. 3 is a sectional view taken along line III-III of FIG. 2;

FIG. 4 is a sectional view taken along line IV-IV of FIG. 2;

FIG. 5 is a structure view of a NAND cell unit;

FIG. 6 is a bird's eye view of the NAND cell unit;

FIG. 7 is an equivalent circuit view of a memory cell array;

FIG. 8 is a comparative view comparing a BiCS-NAND with a two-dimensional NAND;

FIG. 9 is a view showing a first example of a block layout;

FIG. 10 is a view showing a second example of the block layout;

FIG. 11 is a view showing a third example of the block layout;

FIG. 12 is a view showing a fourth example of the block layout;

FIG. 13 is a view showing a layout of select gate lines on a bit line side;

FIG. 14 is a plan view when the select gate lines on the bit line side shown in FIG. 13 are arranged as a device;

FIG. 15 is a view showing a layout of select gate lines on a bit line side;

FIG. 16 is a plan view when the select gate lines on the bit line side shown in FIG. 15 are arranged as a device;

FIG. 17 is a view showing a layout of select gate lines on a bit line side;

FIG. 18 is a view showing an example of a driver circuit;

FIG. 19 is a view explaining program disturb of a BiCS memory;

FIG. 20 is a view explaining the program disturb of the BiCS memory;

FIG. 21 is a timing chart showing a first programming method;

FIG. 22 is a view showing a potential relation of the first programming method;

FIG. 23 is a view showing a potential relation of the first programming method;

FIG. 24 is a timing chart showing a second programming method;

FIG. 25 is a view showing a potential relation of the second programming method;

FIG. 26 is a view showing how an initial potential is set to a channel from the source line side;

FIG. 27 is a view showing a potential relation of the second programming method; and

FIG. 28 is a view showing a potential relation of the second programming method.

DETAILED DESCRIPTION OF THE INVENTION

A three dimensional stacked nonvolatile semiconductor memory of an aspect of the present invention will be described below in detail with reference to the accompanying drawing.

1. Outline

Examples of the present invention propose a programming method of preventing program disturb caused by a structure specific to the BiCS memory, i.e., a variation of a threshold value of a memory cell in a non-selected cell unit in a selected block.

When a feature of the BiCS memory is expressed simply from the viewpoint of a circuit, cell units are included in one block connected to one bit line. In this case, a non-selected cell unit, in which a memory cell to be programmed is not included, exists in a selected block in programming.

Further, in the BiCS memory, no program potential is applied to a cell unit in a non-selected block. However, since a program potential is applied to a non-selected cell unit in a selected block, it is necessary to prevent program disturb (variation of threshold value) to the non-selected cell unit.

Thus, the examples of the present invention propose to use a plus potential capable of improving program disturb as an initial potential of a channel of a memory cell in a non-selected cell unit in a selected block.

When the program potential or a transfer potential lower than the program potential is applied to a word line in the selected block, since the potential of the channel can be sufficiently increased by capacitance coupling by using the plus potential as the initial potential of the channel of the memory cell in the non-selected cell unit in the selected block, the program disturb can be improved.

Further, the BiCS memory has a structural feature in that a source diffusion layer common to blocks is formed in a semiconductor substrate. Accordingly, the plus potential can be applied from the source diffusion layer to the channel of the memory cell in the cell unit in the selected block as the initial potential in the programming.

This means that the initial potential of the channel can be set from the source diffusion layer during a period in which a bit line is set to a potential according to program data.

That is, since the bit line has a large capacity, it takes a long time to charge and discharge the bit line. However, this process cannot be omitted. When the plus potential is applied from the source diffusion layer to the channel of the memory cell, this operation can be performed in parallel with the operation for setting the bit line to the potential according to the program data. Therefore, no time penalty newly occurs.

2. BiCS Memory (1) Basic Structure

First, a basic structure of a BiCS memory will be explained.

FIG. 1 shows a bird's eye view of a BiCS-NAND flash memory.

The NAND flash memory is comprised of blocks each of which acts, for example, as a unit to be erased. Here, two blocks BK<i>, BK<i+1> are shown in the figure.

A source diffusion layer 24 formed in a semiconductor substrate is shared by, for example, all the blocks. The source diffusion layer 24 is connected to a source line SL·M1 through a contact plug P_(SL). Further, at least three conductive layers (in the example, six-layer structure) comprised of, for example, conductive polysilicon are laminated on the source diffusion layer 24.

The remaining five conductive layers excluding the uppermost layer are formed in a plate shape, respectively in the one block BK<i+1> as well as the ends thereof in the X-direction are formed stepwise so that they are in contact with the respective conductive layers. A lowermost layer acts as a select gate line SGS on the source line side, and the remaining four conductive layers excluding the lowermost and uppermost layers act as word lines WL<0>, WL<1>, WL<2>, and WL<3>.

The uppermost layer is comprised of line-shaped conductive wires extending in the X-direction. Six conductive wires, for example, are disposed in the one block BK<i+1>. The six conductive wires, for example, of the uppermost layer act as select gate lines SGD<0>, . . . , SGD<5> on a bit line side.

Active layers (active areas) AA for comprising a NAND cell unit are formed columnarly in a Z-direction (direction vertical to the front surface of the semiconductor substrate) so that they reach the source diffusion layer 24 passing through the conductive layers.

The upper ends of the active layers AA are connected to bit lines BL<0>, . . . , BL<m> extending in a Y-direction. Further, the select gate line SGS on the source line side is connected to an interconnect line SGS·M1 extending in the X-direction through a contact plug P_(SGS), and the word lines WL<0>, WL<1>, WL<2>, and WL<3> are connected to interconnect lines WL<0>·M1, WL<1>·M1, WL<2>·M1, and WL<3>·M1 extending in the X-direction through contact plugs P_(WL<0>), P_(WL<1>), P_(WL<2>), P_(WL<3>), respectively.

Further, select gate lines SGD<0>, . . . , SGD<5> on the bit line side are connected to interconnect lines SGD<0>·M1, . . . , SGD<5>·M1 extending in the X-direction through contact plugs P_(SGD<0>), . . . , P_(SGD<5>), respectively.

The bit lines BL<0>, . . . , BL<m> and the interconnect lines SGS·M1, WL<0>·M1, WL<1>·M1, WL<2>·M1, and WL<3>·M1, SGD<0>·M1, . . . , SGD<5>·M1 are comprised of, for example, metal.

FIG. 2 shows a plan view of the BiCS-NAND flash memory of FIG. 1.

The columnar active layers AA are disposed in an array-state when viewed from the upper surface of the semiconductor substrate and comprise a memory cell array 15. Although the NAND cell unit is formed in each of the active layers AA, it will be described later in detail.

WL drivers 11-i and 11(i+1) are connected to the word lines WL<0>, WL<1>, WL<2>, and WL<3> through the interconnect lines WL<0>·M1, WL<1>·M1, WL<2>·M1, WL<3>·M1 and drive them in write, in read, and in erase.

SGS drivers 12-i and 12-(i+1) are connected to the select gate line SGS on the source line side through the interconnect line SGS·M1. A SGD driver 13 is connected to the select gate lines SGD<0>, . . . , SGD<5> on the bit line side through the interconnect lines SGD<0>·M1, . . . , SGD<5>·M1.

An SL driver 14 is connected to the source diffusion layer 24 through the source line SL·M1.

In this layout, the WL drivers 11-i and 11-(i+1) and the SGS drivers 12-i and 12-(i+1) are disposed on one end side of the memory cell array 15 in the X-direction, and the SGD driver 13 is disposed on the other end side of the memory cell array 15 in the X-direction in consideration of an increase of the number of transistors comprising the drivers as a peripheral circuit.

FIG. 3 is a sectional view taken along line III-III of FIG. 2.

An N-type well region (N-well) 22 and a P-type well region (P-well) 23 are formed in a P-type semiconductor substrate (P-sub) 21. The source diffusion layer 24 is an N-type diffusion layer and formed in the P-type well region 23.

An N-channel FET (for example, N-channel MOSFET) 25 is formed in the P-type semiconductor substrate 21, and a P-channel FET (for example, P-channel MOSFET) 26 is formed in the N-type well region 22. These transistors comprise the peripheral circuit (for example, the drivers) formed in a peripheral portion of a memory cell array.

The select gate line SGS on the source line side and the word lines WL<0>, WL<1>, WL<2>, and WL<3> are connected to the transistors comprising the drivers through the interconnect line in a first metal layer M1 and through an interconnect line in a second metal layer M2 on the first metal layer M1.

To explain the word line WL<3> as an example, the word line WL<3> is connected to the N-channel FET 25 comprising a word line driver through the interconnect line WL<3>·M1 in the first metal layer M1 and through an interconnect line WL<3>·M2 in the second metal layer M2 on the first metal layer M1.

Here, gate electrodes of the N-channel FET 25 and the P-channel FET 26 are formed simultaneously with, for example, the select gate line SGS on the source line side.

That is, the gate electrodes of the N-channel FET 25 and the P-channel FET 26 have the same structure and the same thickness as those of the select gate line SGS on the source line side.

FIG. 4 is a sectional view taken along line IV-IV of FIG. 2.

One ends (lowermost portions) of the active layers (active areas) AA are connected to the source diffusion layer 24 passing through the select gate line SGS on the source line side, the word lines WL<0>, WL<1>, WL<2>, and WL<3>, and the select gate lines SGD<0>, . . . , SGD<5> on the bit line side, and the other ends (uppermost portions) thereof are connected to a bit line BL<0>.

The active layers AA are formed columnarly in the Z-direction (direction vertical to the front surface of the semiconductor substrate), and the NAND cell unit NAND is formed in each of the active layers AA.

FIG. 5 shows an example of a structure of the NAND cell unit NAND.

A memory cell MC has a MONOS structure.

The MONOS structure means a memory cell structure having a charge accumulation layer comprised of an insulation material such as nitride. The charge accumulation layer has a multilayer structure (charge trap layers), and ONO (oxide/nitride/oxide) is exemplified here.

A select gate transistor ST has the same structure as that of, for example, the memory cell MC.

However, a gate insulation film of the select gate transistor ST may have a structure different from that of the memory cell MC, i.e., may have a structure that includes no charge accumulation layer (for example, a single silicon oxide film).

FIG. 6 shows a bird's eye view of the NAND cell unit.

One of the features of the NAND cell unit having a three dimensional structure resides in that each of the select gate line SGS on the source line side, the word lines WL<0>, WL<1>, WL<2>, and WL<3>, and the select gate lines SGD<0>, . . . , SGD<5> on the bit line side has a structure for surrounding each of the side surfaces of the columnar active layers AA.

Accordingly, even if the active layers AA are made, for example, thinner and a larger number of the active layers AA are formed on the semiconductor substrate to increase a capacity, a force for driving the transistors comprising the NAND cell unit can be sufficiently secured.

FIG. 7 shows an equivalent circuit of the memory cell array.

Since the BiCS-NAND flash memory has a three dimensional structure, the equivalent circuit is shown in a three dimension.

A larger number of memory cells comprising a NAND column can more contribute to an increase of the capacity. However, as the number of the memory cells comprising the NAND column is more increased, there is a possibility that the characteristics of the memory cells are dispersed in a manufacturing process from the characteristics of a BiCS structure.

When the dispersion of the characteristics is taken into consideration, the NAND column is comprised of a smaller number of the memory cells (for example, four memory cells, eight memory cells, and the like). Further, the same structure may be stacked on a structure shown by the equivalent circuit of FIG. 7.

FIG. 8 is a view showing BiCS-NAND in comparison with two-dimensional NAND.

In the NAND flash memory having the two-dimensional structure (two-dimensional NAND), one NAND cell unit in one block is connected to one bit line BL, whereas in the BiCS-NAND, NAND cell units in one block are connected to one bit line BL.

Accordingly, as explained below, one of the cell units in the one block connected to the one bit line BL is selected by the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in a write operation and a read operation.

(2) Basic Operations

Basic operations of the BiCS-NAND flash memory of FIGS. 1 to 8 will be explained.

Since the basic write, read, and erase operations are the same as those of the NAND flash memory having the two-dimensional structure, matters specific to the BiCS-NAND flash memory will be explained here.

The concept of one block of the BiCS-NAND flash memory is different from that of the NAND flash memory having the two-dimensional structure.

Although the one NAND cell unit in the one block is connected to the one bit line BL in the NAND flash memory having the two-dimensional structure, the NAND cell units in the one block are connected to the one bit line BL in the BiCS-NAND flash memory.

For example, in the plan view of FIG. 2, six NAND cell units (corresponding to the number of the active layers AA in the figure) in the block BK<i+1> are connected to the bit line BL<0>.

Accordingly, in the write operation and the read operation, one of the six NAND cell units in the block BK<i+1> connected to the bit line BL<0> must be selected.

The selection is performed by select gate lines SGD<0>, . . . , SGD<5> on the bit line side. The select gate lines SGD<0>, . . . , SGD<5> on the bit line side are individually connected to six NAND cell units in the Y-direction in the block BK<i+1>.

The erase operation is performed collectively to, for example, all the memory cells in one block like the NAND flash memory having the two-dimensional structure.

The BiCS-NAND flash memory can be applied to both a binary memory, which stores binary data in one memory cell, and a multi-level memory which stores multi-level data having a ternary or more value in one memory cell.

3. Embodiments

Embodiments of the present invention will be explained.

(1) Block Layout

FIG. 9 shows a first example of a block layout of a BiCS memory.

The block layout corresponds to, for example, the BiCS-NAND flash memory of FIG. 1 and has a feature in that a select gate line driver on a bit line side is disposed on one end of a memory cell array and a word line driver and a select gate line driver on a source line side are disposed on the other end of the memory cell array.

A driver 33L, a level shifter 34L, and an address decoder 35L are disposed on one end of the memory cell array 31 in the X-direction (on a left side). The driver 33L is a driver for driving a select gate line SGD on the bit line side and includes a transfer transistor.

A driver 33R, a level shifter 34R, and an address decoder 35R are disposed on the other end of the memory cell array 31 in the X-direction (on a right side). The driver 33R is a driver for driving a word line WL and a select gate line SGS on the source line side and includes a transfer transistor. Further, a sense amplifier 32 is disposed on one end of the memory cell array 31 in the Y-direction. Bit lines BL<0>, . . . , BL<m> extending in the Y-direction are disposed on the memory cell array 31 and connected to the sense amplifier 32.

Since the block layout corresponds to the BiCS-NAND flash memory of FIG. 1, the select gate lines SGD on the bit line side are disposed in blocks BK<0>, BL<1>, . . . , BK<n>, respectively. That is, since the area of the driver for driving the select gate lines SGD on the bit line side is increased, a select gate line driver on the bit line side is disposed on the one end of the memory cell array 31 in addition to the word line driver and the select gate line driver on the source line side disposed on the other end of the memory cell array 31.

FIG. 10 shows a second example of the block layout of the BiCS memory.

A feature of the second example resides in that drivers 33 for driving a word line WL, a select gate line SGS on a source line side, and a select gate line SGD on a bit line side are disposed together on one end of a memory cell array 31 in comparison with the first example.

The drivers 33, a level shifter 34, and an address decoder 35 are disposed on one end of the memory cell array 31 in the X-direction (on a left side). The drivers 33 are drivers for driving the word line WL, the select gate line SGS on the source line side, and the select gate line SGD on the bit line side and include transfer transistors.

Further, a sense amplifier 32 is disposed on one end of the memory cell array 31 in the Y-direction. Bit lines BL<0>, . . . , BL<m> extending in the Y-direction are disposed on the memory cell array 31 and connected to the sense amplifier 32.

Since the drivers 33 for driving the word line WL, the select gate line SGS on the source line side and the select gate line SGD on the bit line side are disposed together as described above, an address decoder 34 and a level shifter 35 can be also disposed together at the same position. As a result, a layout of a peripheral circuit can be made efficiently.

However, in the above block layout, when the size of the memory cell array 31 is increased and further a memory cell is miniaturized and the word line WL, the select gate line SGS on the source line side and the select gate line SGD on the bit line side are disposed at narrow pitches, a problem arises in that a signal is delayed by a parasitic capacitance.

In particular, in the BiCS memory, the word line WL and the select gate line SGS on the source line side are formed in a plate shape. Accordingly, coupling noise is caused by an increase of a parasitic capacitance between the conductive wires.

FIG. 11 shows a third example of the block layout of the BiCS memory.

A feature of the third example resides in that two memory cell arrays 31L and 31R are disposed in comparison with the second example. Since disposition of the two memory cell arrays 31L and 31R can reduce the lengths of a word line WL, a select gate line SGS on a source line side, and a select gate line SGD on a bit line side in each memory cell array, a signal delay and coupling noise can be suppressed.

The memory cell arrays 31L and 31R are disposed in the X-direction side by side. Drivers 33L and 33R, a level shifter 34, and an address decoder 35 are interposed between the memory cell arrays 31L and 31R. The drivers 33L and 33R drive the word line WL, the select gate line SGS on the source line side, and the select gate line SGD on the bit line side and include transfer transistors.

Further, sense amplifiers 32L and 32R are disposed on one ends of the memory cell arrays 31L and 31R in the Y-direction. Bit lines BL<0>, . . . , BL<m> extending in the Y-direction are disposed on the memory cell arrays 31L and 31R and connected to the sense amplifiers 32L and 32R.

Although the level shifter 34 and the address decoder 35 are shared by the two memory cell arrays 31L and 31R in the block layout, the driver 33L is disposed in correspondence with the memory cell array 31L, and the driver 33R is disposed in correspondence with the memory cell array 31R.

A reason why the drivers 33L and 33R cannot be shared by the two memory cell arrays 31L and 31R as described above is that since the drivers are comprised of a lot of transistors, when they are shared by the memory cell arrays 31L and 31R, a layout of wirings for connecting the memory cell arrays 31L and 31R to the drivers 33L and 33R is made complex. That is, when a driver is disposed on each memory cell array, a layout of wirings is more simplified than a case that the drivers are shared by the two memory cell arrays.

FIG. 12 shows a fourth example of the block layout of the BiCS memory.

A feature of the fourth example resides in that a driver 33 for driving a word line WL, a select gate line SGS on a source line side, and a select gate line SGD on a bit line side is shared by two memory cell arrays 31L and 3IR in comparison with the third example.

The driver 33 can be shared by reducing the area thereof, i.e., by reducing the number of transistors comprising the driver 33. More specifically, this is because a layout of wirings for connecting the memory cell arrays 31L and 31R to the driver 33 is not made complex by reducing the number of the transistors. Although the number of the transistors comprising the driver 33 is reduced by a layout of the select gate line on the bit line side according to the present invention, this will be described later and only the block layout will be explained here.

The memory cell arrays 31L and 31R are disposed in the X-direction side by side. The driver 33, a level shifter 34, and an address decoder 35 are interposed between the memory cell arrays 31L and 31R. The driver 33 drives the word line WL, the select gate line SGS on the source line side and the select gate line SGD on the bit line side and includes a transfer transistor.

Further, sense amplifiers 32L and 32R are disposed on one ends of the memory cell arrays 31L and 31R in the Y-direction. Bit lines BL<0>, . . . , BL<m> extending in the Y-direction are disposed on the memory cell arrays 31L and 31R and connected to the sense amplifiers 32L and 32R.

In the block layout, the driver 33, the level shifter 34, and the address decoder 35 are shared by the two memory cell arrays 31L and 31R.

Note that if the driver 33 can be shared by the two memory cell arrays 31L and 31R without making a wiring layout complex, the block layout of the fourth example will be most preferable in the first to fourth examples.

(2) Layout of Select Gate Line on Bit Line Side

FIG. 13 shows a first example of a layout of select gate lines on a bit line side. FIG. 14 shows a layout viewed on a plan view when the select gate lines shown in FIG. 13 are arranged as a device.

The first example corresponds to the block layout of FIG. 9. That is, a driver 33L connected to select gate lines SGD<0>, . . . , SGD<5> on a bit line side is disposed on one end (left side) of a memory cell array 31 in the X-direction as well as disposed independently of a driver 33R connected to word lines WL<0>, . . . , WL<3> and to a select gate line SGS on a source line side.

Each of two blocks BK<i>, BK<i+1> is comprised of at least three conductive layers, which are insulated from each other and stacked on a semiconductor substrate, bit lines BL<0>, . . . , BL<m>, which are insulated from the at least three conductive layers and disposed thereon, and active layers (columnar semiconductors) AA whose lower ends are connected to the semiconductor substrate, whose upper ends are connected to the bit lines BL<0>, . . . , BL<m>, and which pass through the at least three conductive layers.

The uppermost layer of the at least three conductive layers is comprised of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side, the lowermost layer of the at least three conductive layers is the select gate line SGS on the source line side, and the remaining conductive layers excluding the uppermost and lowermost layers of the at least three conductive layers are the word lines WL<0>, . . . , WL<3>.

In the first example, although the number of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side is six and the number of the word lines WL<0>, . . . , WL<3> is four in one block, respectively, the numbers are not limited thereto. That is, it is sufficient that the number of the select gate lines on the bit line side and the number of the word lines be at least one in the one block, respectively.

Further, the remaining conductive layers excluding the uppermost layer of the at least three conductive layers have a plate shape whose width in the Y-direction is larger than that in the Y-direction of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side.

Select gate transistors on the bit line side are comprised of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side and the active layers AA, and select gate transistors on the source line side are comprised of the select gate line SGS on the source line side and the active layers AA. Further, memory cells are comprised of the word lines WL<0>, . . . , WL<3> and the active layers AA.

Further, the region between the memory cell array 31 and the driver 33L is arranged as an interconnect portion 36L in which interconnect lines (conductive wires) SGD<0>·M1, . . . , SGD<5>·M1 are disposed to connect the memory cell array 31 to the driver 33L. Likewise, the region between the memory cell array 31 and the driver 33R is arranged as an interconnect portion 36R in which interconnect lines (conductive wires) WL<0>·M1, . . . , WL<3>·M1, and SGS·M1 are disposed to connect the memory cell array 31 to the driver 33R.

The select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i> and the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i+1> are connected to the driver 33L after they are commonly connected in the relation of one to one in one end in the X-direction of the memory cell array 31.

Specifically, an i-th (i is a natural number) select gate line on the bit line side from the block BK<i+1> side of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i> is commonly connected to an i-th select gate line on the bit line side from the block BK<i> side of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i+1>.

Accordingly, the select gate lines SGD<0>, . . . , SGD<5> on the bit line side have a folded layout in their entirety.

The folded layout can be easily formed by making use of, for example, a side wall masking technology for etching a ground layer using a side wall as a mask.

FIG. 15 shows a second example of a layout of select gate lines on a bit line side. FIG. 16 shows a layout viewed on a plan view when the select gate lines shown in FIG. 15 are arranged as a device.

The second example corresponds to the block layout of FIGS. 10 to 12. That is, a driver 33 (33L and 33R) disposed on one end (right side) of a memory cell array 31 in the X-direction are connected to word lines WL<0>, . . . , WL<3>, a select gate line SGS on a source line side, and select gate lines SGD<0> . . . , SGD<5> on a bit line side.

Attention must be paid to the fact that the portions, to which the layouts of FIGS. 15 and 16 are applied as they are, are limited to the portion between the memory cell array 31L and the driver 33L of FIG. 11 and to the portion between the memory cell array 31L and the driver 33 of FIG. 12.

The layouts, which are obtained by reversing the layouts of FIGS. 15 and 16 in a right to left direction, are applied to the remaining portions between the memory cell array 31 and the driver 33 of FIG. 10, between the memory cell array 31R and the driver 33R of FIG. 11, and between the memory cell array 31R and the driver 33 of FIG. 12.

Each of two blocks BK<i>, BK<i+1> is comprised of at least three conductive layers, which are insulated from each other and stacked on a semiconductor substrate, bit lines BL<0>, . . . , BL<m>, which are insulated from the at least three conductive layers and disposed thereon, and active layers (columnar semiconductors) AA whose lower ends are connected to the semiconductor substrate, whose upper ends are connected to the bit lines BL<0>, . . . , BL<m>, and which pass through the at least three conductive layers.

The uppermost layer of the at least three conductive layers is comprised of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side, the lowermost layer of the at least three conductive layers is a select gate line SGS on the source line side, and the remaining conductive layers excluding the uppermost and lowermost layers of the at least three conductive layers are the word lines WL<0>, . . . , WL<3>.

In the second example, although the number of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side is six and the number of the word lines WL<0>, . . . , WL<3> is four in one block, respectively, the numbers are not limited thereto. That is, it is sufficient that the number of the select gate lines on the bit line side and the number of the word lines be at least one in the one block, respectively.

Further, the remaining conductive layers excluding the uppermost layer of the at least three conductive layers have a plate shape whose width in the Y-direction is larger than the width in the Y-direction of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side.

Select gate transistors on the bit line side are comprised of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side and the active layers AA, and select gate transistors on the source line side are comprised of the select gate line SGS on the source line side and the active layers AA. Further, memory cells are comprised of the word lines WL<0>, . . . , WL<3> and the active layers AA.

Further, the region between the memory cell array 31 (31L and 31R) and the driver 33L (33L and 33R) is arranged as an interconnect portion 36 in which interconnect lines (conductive wires) WL<0>·M1, . . . , WL<3>·M1, SGS·M1, SGD<0>·M1, . . . , SGD<5>·M1 are disposed to connect the memory cell array 31 to the driver 33.

The select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i> and the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i+1> are connected to the driver 33 (33L and 33R) after they are commonly connected in the relation of one to one in one end in the X-direction (right side) of the memory cell array 31.

Specifically, an i-th (i is a natural number) select gate line on the bit line side from the block BK<i+1> side of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i> is commonly connected to an i-th select gate line on the bit line side from the block BK<i> side of the select gate lines SGD<0>, . . . , SGD<5> on the bit line side in the block BK<i+1>.

Accordingly, the select gate lines SGD<0>, . . . , SGD<5> on the bit line side have a folded layout in their entirety.

The folded layout can be easily formed by making use of, for example, a side wall masking technology for etching a ground layer using a side wall as a mask.

FIG. 17 shows a layout in which drivers are disposed on both the sides of a memory cell array.

As shown in FIG. 17, when the drivers 33 are disposed on both the sides of the memory cell array 31, the size of the drivers 33 in the Y-direction per, for example, one block can be increased (the number of transistors can be increased). As a result, since the size of the drivers 33 in the X-direction can be decreased (the number of transistors can be decreased), the layout of interconnect lines (conductive wires) WL<0>·M1, . . . , WL<3>·M1, SGS·M1, SGD<0>·M1, . . . , SGD<5>·M1 in an interconnect portion 36 is further simplified.

Note that whether the drivers 33 (33L and 33R) are disposed on one sides of the memory cell arrays 31 (31L and 31R) as shown in FIGS. 10 to 12 or the drivers 33 are disposed on both the sides of the memory cell array 31 as shown in FIG. 17 is determined in consideration of the specification of the BiCS memory (chip), the area efficiency of the peripheral circuit, and the like.

(3) Example of Driver Circuit

An example of a driver circuit will be explained using a BiCS-NAND flash memory as an example.

FIG. 18 shows the example of the driver circuit.

It is assumed that each of memory cell arrays is arranged such that four word lines are disposed in one block (four layers), eight select gate lines (one layer) are disposed on a bit line side, and one select gate line (one layer) is disposed on a source line side.

A driver 33 is comprised of a transfer transistor (high voltage transistor) to which a high voltage is applied. Each of row decoders 35 is comprised of an AND circuit and decodes an address signal ADDRESS. Level shifters 34 are connected between the driver 33 and the row decoders 35.

BSTON, VRDEC, RDECANDn<0>, and RDECANDn<1> are control signals for turning on and off transfer transistors, and SGD<7:0>, CGi<3:0>, CG(i+1)<3:0>, SGSi, SGS(i+1), VRDEC2, and SGDS are transfer voltages.

(4) Program Disturb and Programming Method A. Program Disturb

First, program disturb specific to a BiCS memory will be explained using a BiCS-NAND flash memory as an example.

FIG. 19 shows three blocks.

It is assumed that a block BK<i> is a selected block and blocks BK<i−1> and BK<i+1> are non-selected blocks.

The selected block BK<i> has a selected NAND cell unit NAND-select including a memory cell to be programmed. The NAND cell unit NAND-select is located at the intersection point where a selected bit line BL-select and a selected select gate line SGD-select on the bit line side intersect with each other.

In programming, a program potential Vpmg is applied to a selected word line WL-select in the selected block BK<i>, and a transfer potential Vpass lower than the program potential Vpmg is applied to non-selected word lines WL-unselect in the selected block BK<i>.

Since the NAND cell units in the block BK<i> share the word lines, the program potential Vpmg and the transfer potential Vpass are applied also to the non-selected NAND cell units other than the selected NAND cell unit in the block BK<i>.

When the programming is performed, the relation of the potentials in the block BK<i> is as shown in FIG. 20.

The program potential Vpmg is applied to the selected word line WL-select, and the transfer potential Vpass is applied to the non-selected word lines WL-unselect. Further, Vsgd (for example, about 4 V) is applied to a selected select gate line SGD-select on the bit line side as a potential for turning on select gate transistors on the bit line side, and Vss (for example, 0 V) is applied to non-selected selected select gate lines SGD-unselect on the bit line side as a potential for turning off the select gate transistors on the bit line side. A potential according to program data (“0” or “1”) is applied to the selected bit line BL-select.

At this time, program disturb is liable to occur particularly in a non-selected memory cell X1 connected to the word line WL-select to which the program potential Vpgm is applied. A channel boost technology is applied to improve the program disturb.

In, for example, a NAND type flash memory having a two dimensional structure, the channel boost technology is applied to a memory cell subjected to write prohibition (“1”-programming) to prevent an increase of the threshold value of the memory cell. However, in a BiCS-NAND flash memory, the channel boost technology is applied to a non-selected NAND cell unit in a selected block as described above unlike the NAND type flash memory having the two dimensional structure.

Thus, the channel boost technology is applied in consideration of a structure (operation) specific to the BiCS-NAND flash memory.

B. First Programming Method

FIG. 21 is a timing chart showing a first programming method.

First, word lines WL<0>, WL<1>, WL<2>, WL<3>, bit lines BL<0>, BL<1>, select gate lines SGD on a bit line side, source lines SL, and select gate lines SGS on a source line side are set to VSS (for example, 0 V) in all the blocks.

That is, as shown in FIG. 22, the initial potentials of the channels of NAND cell units NAND-select1, NAND-select2, and NAND-unselect in a selected block BK<i> are set to Vss.

Thereafter, the bit lines BL<0>, BL<1> are set to values according to program data as well as the select gate lines SGD on the bit line side in the selected NAND cell units NAND-select1 and NAND-select2 are set to Vsgd (line A). Vsgd shows a potential of, for example, about 4 V by which the program data can transferred. The select gate line SGD on the bit line side in the non-selected NAND cell unit NAND-unselect remains Vss (line B).

Further, when the program data is set to “0”, the bit lines are set to Vss (for example, 0 V), whereas when the program data is set to “1”, the bit lines are set to Vdd (for example, plus potential).

As shown in FIG. 23, it is assumed here that “0”-programming (write) is performed to a selected memory cell MC-select1 in the selected block BK<i>, and “1”-programming (write prohibition) is performed to a selected memory cell MC-select2 in the selected block BK<i>.

In the NAND cell unit NAND-select1, since select gate transistors on the bit line side are turned on, the potential Vss of the bit line BL<0> is transferred to the channel of the memory cell in the NAND cell unit NAND-select1. Accordingly, when a program potential Vpgm is applied to a word line WL<2> in the selected block BK<i>, write (an increase of threshold value) is permitted to the selected memory cell MC-select1.

In contrast, in the NAND cell unit NAND-select2, when a transfer potential Vpass is applied to the word lines WL<0>, WL<1>, WL<2>, and WL<3> in the selected block BK<i>, the select gate transistors on the bit line side are turned off. Accordingly, when the program potential Vpgm is applied to the word line WL<2> in the selected block BK<i>, the channel potential of the selected memory cell MC-select2 is boosted, and the write (an increase of threshold value) is prohibited to the selected memory cell MC-select2. Further, in the non-selected NAND cell unit NAND-unselect in the selected block BK<i>, the select gate transistors on the bit line side and select gate transistors on the source line side remain turned off together. Accordingly, when the transfer potential Vpass and the program potential Vpgm are applied to the word lines WL<0>, WL<1>, WL<2>, WL<3> in the selected block BK<i>, the channel potential of the memory cell in the non-selected NAND cell unit NAND-unselect is boosted. Thus, the program disturb (variation of threshold value) can be prevented.

The first programming method improves the program disturb to the memory cell in the non-selected NAND cell unit NAND-unselect in the selected block BK<i>. However, the first programming method cannot sufficiently improve the program disturb to a non-selected memory cell X1 to which the program potential Vpgm is applied.

C. Second Programming Method

A second programming method proposes a technology for improving the program disturb also to the non-selected memory cell X1 to which the program potential Vpgm is applied by sufficiently increasing the channel potential of the memory cell in the non-selected NAND cell unit NAND-unselect.

FIG. 24 is a timing chart showing the second programming method.

First, the word lines WL<0>, WL<1>, WL<2>, WL<3>, the bit lines BL<0>, BL<1>, the select gate lines SGD on the bit line side, the source lines SL, and the select gate lines SGS on the source line side are set to VSS (for example, 0 V) in all the blocks.

At this time, as shown in FIG. 25, the initial potentials of the channels of the NAND cell units NAND-select1, NAND-select2, and NAND-unselect in the selected block BK<i> are set to Vss.

Thereafter, the bit lines BL<0>, BL<1> are set to a value according to the program data as well as all the select gate lines SGS on the source line side in the selected block BK<i> are set to Vsgs. Vsgs is a value by which a precharge potential to be described later can be transferred, i.e., a potential of, for example, about 4 V. The select gate lines SGS on the source line side in the non-selected blocks BK<i−1>, BK<i+1> remain Vss.

Further, a potential VP-pre, by which a memory cell is turned on regardless of the state of the threshold value thereof is applied to all the word lines WL<0>, WL<1>, WL<2>, WL<3> in the selected block BK<i>. The potential VP-pre is a potential which is the same as or nearly equal to, for example, a read potential Vread which is applied to a non-selected memory cell in read.

The plus potential, for example, Vdd is applied to a source line SL as a precharge potential, further all the select gate lines SGD on the bit line side remain Vss, and the select gate transistors on the bit line side remain turned off.

As a result, the initial potentials of the channels of the memory cells in all the NAND cell units NAND-select1, NAND-select2, and NAND-unselect in the selected block BK<i> are precharged to the plus potential, for example, Vdd as shown FIGS. 26 and 27.

The precharge to the channels are performed in parallel with the operation for setting the bit lines BL<0>, BL<1> to the value according to the program data.

The bit lines are set to Vss (for example, 0 V) when the program data is set to “0” and set to Vdd (for example, plus potential) when the program data is set to “1”.

Thereafter, the select gate lines SGD on the bit line side in the selected NAND cell units NAND-select1, NAND-select2 are set to Vsgd (line A). Vsgd is a value by which the program data can be transferred, i.e., a potential of, for example, about 4 V. The select gate line SGD on the bit line side in the non-selected NAND cell unit NAND-unselect remains Vss (line B).

As shown in FIG. 28, it is assumed here that the “0”-programming (write) is performed to the selected memory cell MC-select1 in the selected block BK<i>, and the “1”-programming (write prohibition) is performed to the selected memory cell MC-select2 in the selected block BK<i>.

In the NAND cell unit NAND-select1, since select gate transistors on the bit line side are turned on, the potential Vss of the bit line BL<0> is transferred to the channel of the memory cell in the NAND cell unit NAND-select1. Accordingly, when the program potential Vpgm is applied to the word line WL<2> in the selected block BK<i>, write (an increase of threshold value) is permitted to the selected memory cell MC-select1.

In contrast, in the NAND cell unit NAND-select2, when the transfer potential Vpass is applied to the word lines WL<0>, WL<1>, WL<2>, and WL<3> in the selected block BK<i>, the select gate transistors on the bit line side are turned off. Accordingly, when the program potential Vpgm is applied to the word line WL<2> in the selected block BK<i>, the channel potential of the selected memory cell MC-select2 is boosted, and the write (an increase of threshold value) is prohibited to the selected memory cell MC-select2.

Further, in the non-selected NAND cell unit NAND-unselect in the selected block BK<i>, the select gate transistors on the bit line side and the select gate transistors on the source line side remain turned off together. Accordingly, when the transfer potential Vpass and the program potential Vpgm are applied to the word lines WL<0>, WL<1>, WL<2>, WL<3> in the selected block BK<i>, the channel potential of the memory cell in the non-selected NAND cell unit NAND-unselect is boosted. Thus, the program disturb (variation of threshold value) can be prevented.

In the second programming method, the initial potentials of the channels of the NAND cell units NAND-select1, NAND-select2, NAND-unselect in the selected block BK<i> are set to the plus potential as compared with the first programming method. As a result, since the channel potential of the memory cell in the non-selected NAND cell unit NAND-unselect is sufficiently boosted, the program disturb can be sufficiently improved to the non-selected memory cell X1 to which the program potential Vpgm is applied.

D. Others

In the first and second programming methods, the potential of the select gate lines SGD on the bit line side is set to a potential larger than Vss and smaller than Vsgd, for example, about 2 V after it is set to Vsgd (for example, 4 V) as shown in FIGS. 21 and 24. The value is a value by which “0” (=Vss) can be transferred. However, the potential of the select gate lines SGD on the bit line side is not limited to the above value and may remain, for example, Vsgd or may be dropped from Vsgd to Vss.

Further, in the second programming method, the potential of the select gate lines SGS on the source line side is set to Vsgs (for example, 4 V) and then dropped to Vss again as shown in FIG. 24. However, the potential of the select gate lines SGS on the source line side is not limited to the above value and may be changed from Vsgs to, for example, a plus potential slightly higher than Vss (line C). In this case, the plus potential is a potential that can securely turn off the select gate transistors on the source side.

Further, in the second programming method, the potential of the word lines WL<0>, WL<1>, WL<2>, WL<3>, may be set from VP-pre to Vpass and further from Vpass to Vpgm. Further, the potential may be dropped from VP-pre to Vss or to a value near Vss once and then may be set to Vpass and further to Vpgm as shown in FIG. 24.

(5) Conclusion

As described above, according to the embodiment of the present invention, program disturb of a three dimensional stacked nonvolatile semiconductor memory to which a BiCS technology is applied can be improved.

4. Application Example

Although the technology of the present invention is effective for a BiCS-NAND flash memory in which one cell unit is comprised of memory cells (NAND columns) connected to each other in series to realize bit cost scalability, the technology can be also applied to a three dimensional stacked nonvolatile semiconductor memory to which the BiCS technology is applied in addition to the above BiCS-NAND flash memory.

For example, the technology of the present invention is also effective for a nonvolatile semiconductor memory which has exactly the same device structure as that of the BiCS-NAND flash memory but in which only one central memory cell of memory cells in one cell unit is used as a memory cell and the remaining memory cells are used as dummy cells as an example other than the BiCS-NAND flash memory.

Further, as to a memory cell structure of the BiCS memory, it is considered that a so-called MONOS type, in which a charge accumulation layer is comprised of an insulation material (for example, nitride), is effective, but the example of the present invention is not limited thereto and can be also applied to a floating gate type in which a charge accumulation layer is comprised of conductive polysilicon.

Further, a data value stored in one memory cell may be a binary value (two-level) or a multivalue (multi-level) of at least a ternary value (three-level).

5. Advantages

According to the present invention, program disturb of a three dimensional stacked nonvolatile semiconductor memory to which a BiCS technology is applied can be improved.

Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the invention in its broader aspects is not limited to the specific details and representative embodiments shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents. 

What is claimed is:
 1. A nonvolatile semiconductor memory comprising: a semiconductor substrate having a main surface extending in a first direction and a second direction crossing the first direction; a plurality of word lines being stacked in a third direction crossing the first direction and the second direction above the semiconductor substrate, and each extending in the first direction; a first memory cell array including a plurality of memory strings, the memory strings including a first memory string and a second memory string, the first memory string including a first select gate transistor, a second select gate transistor and a plurality of first memory cell transistors between the first select gate transistor and the second select gate transistor, gates of the first memory cell transistors are connected to the word lines, respectively, the second memory string including a third select gate transistor, a fourth select gate transistor and a plurality of second memory cell transistors between the third select gate transistor and the fourth select gate transistor, gates of the second memory cell transistors are connected to the word lines, respectively; a bit line connected to one end of the first memory string and one end of the second memory string; a source line connected to the other end of the first memory string and the other end of the second memory string; wherein when a program operation on one of the first memory cell transistors is performed, a first voltage is applied to the word line on a first timing, a second voltage is applied to gates of the second transistor and gates of the fourth transistor at the first timing to turn on the second select gate transistor and the fourth transistor, a third voltage is applied to the gates of the second transistor and the gates of the fourth transistor at a second timing after the first timing to turn off the second select gate transistor and the fourth transistor, a fourth voltage is applied to one of the word lines corresponding to the one of the first memory cell transistors at a third timing after the second timing, and a fifth voltage lower than the fourth voltage is applied to the other ones of the word lines at the third timing.
 2. The non-volatile semiconductor memory according to claim 1, further comprising: a first select gate line extending in the first direction above the word lines, and being connected to a gate of the first select gate transistor; a second select gate line extending in the first direction above the word lines, and connected to a gate of the third select gate transistor; and a third select gate line extending in the first direction below the word lines and above the semiconductor substrate, and connected to the gate of the second select gate transistor and the gate of the fourth select gate transistor.
 3. The non-volatile semiconductor memory according to claim 2, wherein in the program operation, the second voltage is applied to the third select gate line at the first timing, and the third voltage is applied to the third select gate line at the second timing.
 4. The non-volatile semiconductor memory according to claim 3, wherein in the program operation, a sixth voltage higher than the third voltage is applied to the first select gate line at a fourth timing after the second timing and before the third timing, and a seventh voltage lower than the sixth voltage and higher than the third voltage is applied to the first select gate line at a fifth timing after the fourth timing and before the third timing.
 5. The non-volatile semiconductor memory according to claim 4, wherein in the program operation, the fifth voltage is applied to the word lines at a sixth timing after the fifth timing and before the third timing.
 6. The non-volatile semiconductor memory according to claim 1, wherein an eighth voltage is applied to the source line at the first timing.
 7. The non-volatile semiconductor memory according to claim 6, wherein the first memory cell transistors include a first channel, and the second memory cell transistors include second channel, and the eighth voltage is a positive voltage such that at least the second channel of the second memory cell transistors has the positive voltage after the first timing and before the second timing.
 8. The non-volatile semiconductor memory according to claim 6, wherein the second voltage is higher than the sixth voltage such that a voltage is transmitted from the source line to the second channel of the second memory cell transistors of the second memory string.
 9. The non-volatile semiconductor memory according to claim 1, wherein the fourth voltage is a program voltage to increase a threshold voltage of the one of the first memory cell transistors of the first memory string.
 10. The non-volatile semiconductor memory according to claim 1, wherein each of the word lines has a planar shape extending in the first direction and the second direction.
 11. The non-volatile semiconductor memory according to claim 1, wherein a ninth voltage or a tenth voltage is applied to the bit line at the first timing, the ninth voltage or the tenth voltage is switched depending on data to be programmed into the one of the first memory cell transistors. 